NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

The 200 Gbps Challenge: Imagining HL-LHC analysis facilities

https://doi.org/10.1051/epjconf/202533701217

Held, Alexander; Albin, Sam; Attebury, Garhan; Bloom, Kenneth; Bockelman, Brian; Bryant, Lincoln; Choi, Kyungeon; Cranmer, Kyle; Elmer, Peter; Feickert, Matthew; et al (October 2025, EPJ Web of Conferences)
Szumlak, T; Rachwał, B; Dziurda, A; Schulz, M; vom_Bruch, D; Ellis, K; Hageboeck, S (Ed.)
The IRIS-HEP software institute, as a contributor to the broader HEP Python ecosystem, is developing scalable analysis infrastructure and software tools to address the upcoming HL-LHC computing challenges with new approaches and paradigms, driven by our vision of what HL-LHC analysis will require. The institute uses a “Grand Challenge” format, constructing a series of increasingly large, complex, and realistic exercises to show the vision of HL-LHC analysis. Recently, the focus has been demonstrating the IRIS-HEP analysis infrastructure at scale and evaluating technology readiness for production. As a part of the Analysis Grand Challenge activities, the institute executed a “200 Gbps Challenge”, aiming to show sustained data rates into the event processing of multiple analysis pipelines. The challenge integrated teams internal and external to the institute, including operations and facilities, analysis software tools, innovative data delivery and management services, and scalable analysis infrastructure. The challenge showcases the prototypes — including software, services, and facilities — built to process around 200 TB of data in both the CMS NanoAOD and ATLAS PHYSLITE data formats with test pipelines. The teams were able to sustain the 200 Gbps target across multiple pipelines. The pipelines focusing on event rate were able to process at over 30 MHz. These target rates are demanding; the activity revealed considerations for future testing at this scale and changes necessary for physicists to work at this scale in the future. The 200 Gbps Challenge has established a baseline on today’s facilities, setting the stage for the next exercise at twice the scale.
more » « less
Free, publicly-accessible full text available October 7, 2026
Reshaping High Energy Physics Applications for Near-Interactive Execution Using TaskVine

https://doi.org/10.1109/SC41406.2024.00068

Sly-Delgado, Barry; Tovar, Ben; Zhou, Jin; Thain, Douglas (November 2024, IEEE)

High energy physics experiments produce petabytes of data annually that must be reduced to gain insight into the laws of nature. Early-stage reduction executes long-running high-throughput workflows across thousands of nodes spanning multiple facilities to produce shared datasets. Later stages are typically written by individuals or small groups and must be refined and re-run many times for correctness. Reducing iteration times of later stages is key to accelerating discovery. We demonstrate our experience reshaping late-stage analysis applications on thousands of nodes. It is not enough merely to increase scale: it is necessary to make changes throughout the stack, including storage systems, data management, task scheduling, and application design. We demonstrate these changes when applied to two analysis applications built on open source data analysis frameworks (Coffea, Dask, TaskVine). We evaluate the performance of the applications on opportunistic campus clusters, showing effective scaling up to 7200 cores, thus producing significant speedup.
more » « less
Full Text Available
Mixed Modality Workflows in TaskVine

https://doi.org/10.1145/3588195.3595953

Simonetti, David; Tovar, Ben; Thain, Douglas (August 2023, ACM)

Modern scientific workflows desire to mix several different comput- ing modalities: self-contained computational tasks, data-intensive transformations, and serverless function calls. To date, these modali- ties have required distinct system architectures with different sched- uling objectives and constraints. In this paper, we describe how TaskVine, a new workflow execution platform, combines these modalities into an execution platform with shared abstractions. We demonstrate results of the system executing a machine learning workflow with combined standalone tasks and serverless functions.
more » « less
Full Text Available
TaskVine: Managing In-Cluster Storage for High-Throughput Data Intensive Workflows

https://doi.org/10.1145/3624062.3624277

Sly-Delgado, Barry; Phung, Thanh Son; Thomas, Colin; Simonetti, David; Hennessee, Andrew; Tovar, Ben; Thain, Douglas (November 2023, ACM)

Many scientific applications are expressed as high-throughput workflows that consist of large graphs of data assets and tasks to be executed on large parallel and distributed systems. A chal- lenge in executing these workflows is managing data: both datasets and software must be efficiently distributed to cluster nodes; inter- mediate data must be conveyed between tasks; output data must be delivered to its destination. Scaling problems result when these actions are performed in an uncoordinated manner on a shared filesystem. To address this problem, we introduce TaskVine: a sys- tem for exploiting the aggregate local storage and network capacity of a large cluster. TaskVine tracks the lifetime of data in a workflow –from archival sources to final outputs– making use of local storage to distribute, and re-use data wherever possible. We describe the architecture and novel capabilities of TaskVine, and demonstrate its use with applications in genomics, high energy physics, molecular dynamics, and machine learning.
more » « less
Full Text Available
PONCHO: Dynamic Package Synthesis for Distributed and Serverless Python Applications

https://doi.org/10.1145/3526060.3535459

Sly-Delgado, Barry; Locascio, Nick; Simonetti, David; Wiseman, Brett; Tovar, Ben; Thain, Douglas (June 2022, High Performance Serverless Workshop at HPDC)

An increasing number of distributed applications operate by dispatching function invocations across the nodes of a distributed system. To operate correctly, the code and data dependencies of the function must be distributed along with the invocations in some way. When translating applications to work on large scale distributed systems, managing these dependencies becomes challenging: delivery must be scalable to thousands of nodes; the dependencies must be consistent across the system; and the method must be usable by an unprivileged developer. As a solution, in this paper we present PONCHO, which is a lightweight Python based toolkit which allows users to discover, package, and deploy dependencies as an integral part of distributed applications. PONCHO encapsulates a set of commands to be executed within an environment. PONCHO offers a lightweight solution to create and manage environments increasing the portability of scientific applications as well as reproducibility. In this paper, we evaluate PONCHO with real-world applications in the fields of physics, computational chemistry, and hyperparameter optimization, We observe the challenges that arise when creating and distributing an environment and measure the overheads that emerge as a result.
more » « less
Full Text Available
Dynamic Task Shaping for High Throughput Data Analysis Applications in High Energy Physics

https://doi.org/10.1109/IPDPS53621.2022.00041

Tovar, Ben; Lyons, Ben; Mohrman, Kelci; Sly-Delgado, Barry; Lannon, Kevin; Thain, Douglas (May 2022, IPDPS International Parallel and Distributed Processing Symposium)

Distributed data analysis frameworks are widely used for processing large datasets generated by instruments in scientific fields such as astronomy, genomics, and particle physics. Such frameworks partition petabyte-size datasets into chunks and execute many parallel tasks to search for common patterns, locate unusual signals, or compute aggregate properties. When well-configured, such frameworks make it easy to churn through large quantities of data on large clusters. However, configuring frameworks presents a challenge for end users, who must select a variety of parameters such as the blocking of the input data, the number of tasks, the resources allocated to each task, and the size of nodes on which they run. If poorly configured, the result may perform many orders of magnitude worse than optimal, or the application may even fail to make progress at all. Even if a good configuration is found through painstaking observations, the performance may change drastically when the input data or analysis kernel changes. This paper considers the problem of automatically configuring a data analysis application for high energy physics (TopEFT) built upon standard frameworks for physics analysis (Coffea) and distributed tasking (Work Queue). We observe the inherent variability within the application, demonstrate the problems of poor configuration, and then develop several techniques for automatically sizing tasks to meet goals of resource consumption, and overall application completion.
more » « less
Full Text Available
Lightweight Function Monitors for Fine-Grained Management in Large Scale Python Applications

https://doi.org/10.1109/IPDPS49936.2021.00088

Shaffer, Tim; Li, Zhuozhao; Tovar, Ben; Babuji, Yadu; Dasso, TJ; Surma, Zoe; Chard, Kyle; Foster, Ian; Thain, Douglas (May 2021, IEEE International Parallel and Distributed Processing Symposium)
null (Ed.)
Python has become a widely used programming language for research, not only for small one-off analyses, but also for complex application pipelines running at supercomputer- scale. Modern parallel programming frameworks for Python present users with a more granular unit of management than traditional Unix processes and batch submissions: the Python function. We review the challenges involved in running native Python functions at scale, and present techniques for dynamically determining a minimal set of dependencies and for assembling a lightweight function monitor (LFM) that captures the software environment and manages resources at the granularity of single functions. We evaluate these techniques in a range of environ- ments, from campus cluster to supercomputer, and show that our advanced dependency management planning and dynamic re- source management methods provide superior performance and utilization relative to coarser-grained management approaches, achieving several-fold decrease in execution time for several large Python applications.
more » « less
Full Text Available
Reduction of Workflow Resource Consumption Using a Density Based Clustering Model

Zhang, Qimin; Tovar, Ben; Kremer-Herman, Nate; Thain, Douglas (January 2018, Workshop on Workflows in Support of Large-Scale Science)

Full Text Available
Deploying High Throughput Scientific Workflows on Container Schedulers with Makeflow and Mesos

https://doi.org/10.1109/CCGRID.2017.9

Zheng, Chao; Tovar, Ben; Thain, Douglas (May 2017, IEEE/ACM International Symposium on Cluster, Cloud, and Grid Computing)

Workflows are a widely used abstraction for describing large scientific applications and running them on distributed systems. However, most workflow systems have been silent on the question of what execution environment each task in the workflow is expected to run in. Consequently, a workflow may run successfully in the environment it was created, but fail on other platforms due to the differences in execution environment. Container-based schedulers have recently arisen as a potential solution to this problem, adopting containers to distribute computing resources and deliver well-defined execution environments to applications. In this paper, we con- sider how to connect workflow system to container schedulers with minimal performance loss and higher system efficiency. As an example of current technology, we use Makeflow and Mesos. We present five design challenges, and address them by using four configurations that connecting workflow system to container scheduler from different level of the infrastructure. In order to take full advantage of the resource sharing schema of Mesos, we enable the resource monitor of Makeflow to dynamically update the task resource requirement. We explore the performance of a large bioinformatics workflow, and observe that using Makeflow, Work Queue and the Resource monitor together not only increase the transfer throughput but also achieves highest resource usage rate.
more » « less
Full Text Available

Search for: All records